A Method for Chinese Short Text Classification Considering Effective Feature Expansion
نویسندگان
چکیده
منابع مشابه
A Method for Chinese Short Text Classification Considering Effective Feature Expansion
This paper presents a Chinese short text classification method which considering extended semantic constraints and statistical constraints. This method uses “HowNet” tools to build the attribute set of concept. when coming to the part of feature expansion, we judge the collocation between the attribute words of original text and the characteristics before and after expansion as the semantic con...
متن کاملA Novel One Sided Feature Selection Method for Imbalanced Text Classification
The imbalance data can be seen in various areas such as text classification, credit card fraud detection, risk management, web page classification, image classification, medical diagnosis/monitoring, and biological data analysis. The classification algorithms have more tendencies to the large class and might even deal with the minority class data as the outlier data. The text data is one of t...
متن کاملAn Improved CHI Feature Selection Method for Chinese Text Classification
We Proposed a kind of feature selection method named ICHI based on improved CHI. Through the classified experiment ,the result showsthat feature extraction effect of CHI method is better than the traditional CHI’s when them is used to select features in SVM and KNN classification, and the ICHI method can enhance theaccuracy in text classification and it is fittedto extract feather.
متن کاملChinese Short-Text Classification Based on Topic Model with High-Frequency Feature Expansion
Short text differs from traditional documents in its shortness and sparseness. Feature extension can ease the problem of high sparseness in the vector space model, but it inevitably introduces noise. To resolve this problem, this paper proposes a high-frequency feature expansion method based on a latent Dirichlet allocation (LDA) topic model. High-frequency features are extracted from each cate...
متن کاملAn Effective and Robust Method for Short Text Classification
Classification of texts potentially containing a complex and specific terminology requires the use of learning methods that do not rely on extensive feature engineering. In this work we use prediction by partial matching (PPM), a method that compresses texts to capture text features and creates a language model adapted to a particular text. We show that the method achieves a high accuracy of te...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Advanced Research in Artificial Intelligence
سال: 2012
ISSN: 2165-4069,2165-4050
DOI: 10.14569/ijarai.2012.010101